A Simple Statistical Cache Sharing Model for Multicores
نویسندگان
چکیده
The introduction of multicores has made analysis of shared resources, such as shared caches and shared DRAM bandwidth, an important topic to study. We present two simple, but accurate, cache sharing models that use high-level data that can easily be measured on existing systems. We evaluate our model using a simulated multicore processor with four cores and a shared L2 cache. Our evaluation shows that we can predict average sharing in groups of four benchmarks with an average error smaller than 0.79% for random caches and 1.34% for LRU caches.
منابع مشابه
Efficient Resource Oblivious Algorithms for Multicores
We consider the design of efficient algorithms for a multicore computing environment with a global shared memory and p cores, each having a cache of size M , and with data organized in blocks of size B. We characterize the class of ‘Hierarchical Balanced Parallel (HBP)’ multithreaded computations for multicores. HBP computations are similar to the hierarchical divide & conquer algorithms consid...
متن کاملOptimizing Performance in Highly Utilized Multicores with Intelligent Prefetching
Khan, M. 2016. Optimizing Performance in Highly Utilized Multicores with Intelligent Prefetching. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 1335. 54 pp. Uppsala: Acta Universitatis Upsaliensis. ISBN 978-91-554-9450-6. Modern processors apply sophisticated techniques, such as deep cache hierarchies and hardware prefetching, to increase pe...
متن کاملCSHARP: Coherence and SHaring Aware Replacement Policies for Parallel Applications
Parallel applications are becoming mainstream and architectural techniques for multicores that target these applications are the need of the hour. Sharing of data by multiple threads and issues due to data coherence are unique to parallel applications. We propose CSHARP, a hardware framework that brings coherence and sharing awareness to any shared last level cache replacement policy. We use th...
متن کاملRethinking Last-Level Cache Management for Multicores Operating at Near-Threshold Voltages
Near-threshold voltage computing promises an order of magnitude improvement in energy efficiency, enabling future processors to integrate 100s of cores running concurrently. However, such low voltage operation accompanies extreme parametric variations, resulting in unreliable operation of the processor. The memory bit-cells in on-chip caches are most vulnerable to failure due to their tight fun...
متن کاملA Work Stealing Scheduler for Parallel Loops on Shared Cache Multicores
Reordering instructions and data layout can bring significant performance improvement for memory bounded applications. Parallelizing such applications requires a careful design of the algorithm in order to keep the locality of the sequential execution. In this paper, we aim at finding a good parallelization of memory bounded applications on multicore that preserves the advantage of a shared cac...
متن کامل